Measurement-Based Analysis of System Dependability Using Fault Injection and Field Failure Data
نویسندگان
چکیده
The discussion in this paper focuses on the issues involved in analyzing the availability of networked systems using fault injection and the failure data collected by the logging mechanisms built into the system. In particular we address: (1) analysis in the prototype phase using physical fault injection to an actual system. We use example of fault injection-based evaluation of a software-implemented fault tolerance (SIFT) environment (built around a set of self-checking processes called ARMORS) that provides error detection and recovery services to spaceborne scientific applications and (2) measurement-based analysis of systems in the field. We use example of LAN of Windows NT based computers to present methods for collecting and analyzing failure data to characterize network system dependability. Both, fault injection and failure data analysis enable us to study naturally occurring errors and to provide feedback to system designers on potential availability bottlenecks. For example, the study of failures in a network of Windows NT machines reveals that most of the problems that lead to reboots are software related and that though the average availability evaluates to over 99%, a typical machine, on average, provides acceptable service only about 92% of the time.
منابع مشابه
Failure analysis of an ORB in presence of faults
This document describes a method and experimental results for the dependability characterization of middleware implementations, and in particular failure mode analysis of CORBA ORB implementations. The aim of the work is to provide an overall approach for identifying and quantifying failure modes using various fault injection techniques and fault models. Related work in dependability characteri...
متن کاملCharacterization Approaches for CORBA Systems by Fault Injection
This document describes a number of approaches for the dependability characterization of middleware implementations, and in particular failure mode analysis of CORBA ORB implementations. The aim of the work is to provide an overall approach for identifying and quantifying failure modes using various fault injection techniques and fault models. Related work in dependability characterization of e...
متن کاملSystem Dependability Analysis using VHDL Models with Integrated Fault Descriptions 1
When injecting faults in digital systems, most approaches measures only the probability of a fault leading to a failure. This paper addresses the necessity of considering also the fault rates to get reasonable results from the fault injection experiment. For solving this problem we developed a VHDL-based fault injection tool, which allows the correct fault description and which is able to maint...
متن کاملFT-Grid: A Fault-Tolerance System for e-Science
The FT-Grid system introduces a multi-version design -based fault tolerance framework that allows faults occurring in service-based systems to be tolerated, thus increasing the dependability of such systems. This paper details the progress that has been made in the development of FT-Grid, including both a GUI client and also a web service interface. We show empirical evidence of the dependabili...
متن کاملModel-Based Fault Injection for Failure Effect Analysis
We propose a fault-injection system (FIS) that can inject faults such as read/write margin failures and soft errors into a SRAM environment. The fault case generator (FCG) generates time-series SRAM failures in 7T/14T or 6T SRAM, and the proposed device model and fault-injection flow are applicable for system-level verification. For evaluation, an abnormal termination rate in vehicle engine con...
متن کامل